Knowledge Discovery from Symbolic Data and the Sodas Software
نویسنده
چکیده
The data descriptions of the units are called “symbolic” when they are more complex than the standard ones due to the fact that they contain internal variation and are structured. Symbolic data happen from many sources, for instance in order to summarise huge Relational Data Bases by their underlying concepts. “Extracting knowledge” means getting explanatory results, that why, “symbolic objects” are introduced and studied in this paper. They model concepts and constitute an explanatory output for data analysis. Moreover they can be used in order to define queries of a Relational Data Base and propagate concepts between Data Bases. We define “Symbolic Data Analysis” (SDA) as the extension of standard Data Analysis to symbolic data tables as input in order to find symbolic objects as output. In this paper we give an overview on recent development on SDA. We present some tools and methods of SDA and introduce the SODAS software prototype (issued from the work of 17 teams of nine countries involved in an European project of EUROSTAT).
منابع مشابه
An introduction to symbolic data analysis and the SODAS software
The data descriptions of the units are called "symbolic" when they are more complex than standard ones due to the fact that they contain internal variation and are structured. Symbolic data arise from many sources, for instance in order to summarise huge Relational Data Bases by their underlying concepts. "Extracting knowledge" means getting explanatory results, that why, "symbolic objects" are...
متن کاملModélisation des données boursières à l'aide de l'analyse de données symboliques
In this paper we present a model of the stock exchange domain using symbolic data analysis and we use the SODAS software to analyze this domain. After a short presentation of the software, we present the analysis in three steps: choice of the symbolic objects, their definition and their analysis with SODAS. We give details for each of these steps and there importance is underlined. Two examples...
متن کاملSymbolic Analysis of Financial Data
We have as input two portfolio of stocks managed by ING bank (North Holland). Each portfolio and each stock is described by several variables every open day of a week and during eight weeks. A portfolio is characterised by variables as its value, its performance, a stock is characterised by variable as its value, its number in the portfolio and its performance. In this paper, we study a symboli...
متن کاملبررسی کاربردهای داده کاوی در نظام سلامت
Introduction: Extensive amounts of data stored in medical databases require the development of specialized tools for accessing the data, data analysis, knowledge discovery, and the effective use of the data. Data mining is one of the most important methods. The article sketches the used Data Mining techniques, and illustrates their applicability to medical diagnostic and prognostic problems. ...
متن کاملKnowledge discovery from patients’ behavior via clustering-classification algorithms based on weighted eRFM and CLV model: An empirical study in public health care services
The rapid growing of information technology (IT) motivates and makes competitive advantages in health care industry. Nowadays, many hospitals try to build a successful customer relationship management (CRM) to recognize target and potential patients, increase patient loyalty and satisfaction and finally maximize their profitability. Many hospitals have large data warehouses containing customer ...
متن کامل